Gradient-based learning of higher-order features
نویسنده
چکیده
We describe an auto-encoder with multiplicative connections whose hidden variables encode products of pixel intensities. The model allows for efficient learning of image transformations and of higher-order structure within an image-patch. Modelling higher-order structure can be an effective way to improve recognition performance, as shown recently with a similar, probabilistic model of image correlations. In contrast to probabilistic models, auto-encoders can greatly simplify learning and inference, and they allow us to construct multi-layer processing hierarchies containing various pre-processing modules, that can be trained with simple back-propagation. We consider the task of learning higher-order image features, such as pair-wise products between pixel intensities. Unsupervised learning of pairwise products across images was considered by [6] as a means to encode image transformations. It has been extended and applied to various tasks, such as human action recognition [10], transparent motion estimation [7], foveal glimpse combination [5] and others. The model has also been extended to encode within-image pixel products, and was shown to yield state-of-the-art results in various object recognition tasks [8], [9]. An application of that model to language recognition has furthermore been shown to yield the best known result on the well-known TIMIT benchmark [1]. The models are based on hidden variables that are connected to observables (typically two images) in a three-way energy function, which, by exponentiating and normalizing, is turned into a probability distribution [6]. Learning amounts to approximate maximum likelihood, using a variation of contrastive divergence learning [3]. Learning is complicated somewhat by the fact that the basic model is defined using binary observations. Real-valued images are treated specially, and are known to require careful learning-rate control and normalizations to prevent “blow-up”. [9], [8] alternatively suggest using a variant of Hybrid Monte Carlo and various constraints on the model parameters to address these issues. In this work we describe an approach to learning higher-order features using simple gradient-based optimization. Our model is an auto-encoder modeling an image patch y. Connections between observable and hidden variables are multiplied by pixels in another image patch, x, forcing the hidden variables to encode pixel correlations rather than pixel means. The model is a type of higher-order neural network [2] applied to modeling feature correlations. Training amounts to simple back-propagation. This makes it straightforward to incorporate various pre-processing modules, such as low-dimensional projections to reduce computational complexity. In contrast to [7], there is no need for tensor factorization, or even for the manual calculation of gradients, for this purpose. Binary, real-valued and other types of data are dealt with in the usual way by deploying the appropriate activation/cost-functions in the final layer of the network. One can add standard hidden variables encoding first-order structure to the model, if desired, in which case the overall model may be called a “mean-covariance-encoder”. Figure 1 shows an illustration of a particular instance of the model, which can be viewed as non-probabilistic analog of the model described in [7]. The network takes two data-cases, x and y, as input, and projects these onto features (left-most vectors of triangles in the figure). These features are multiplied to provide the activations for hidden units. Reconstructions for y are computed similarly, by combining, multiplicatively, the vector of covariancevariables h with the (known) vector x. This defines a “conditional” model of y as a function x, similar to [6]. The
منابع مشابه
ارائه یک رویکرد فازی برای بهینهسازی پیشبینی سری زمانی با مرتبه بالا
It is difficult to apply the real world’s conceptions due to their uncertainty. Generally, time series are known to be non-linear or non-stationary. Regarding these two features, a system should be sensitive enough to apply the unity of time series and repeat this sensitiveness in the prediction. A predict system can exactly scrutinize the hidden features of time series and also can have high p...
متن کاملA Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features
Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...
متن کاملDetermining Effective Features for Face Detection Using a Hybrid Feature Approach
Detecting faces in cluttered backgrounds and real world has remained as an unsolved problem yet. In this paper, by using composition of some kind of independent features and one of the most common appearance based approaches, and multilayered perceptron (MLP) neural networks, not only some questions have been answered, but also the designed system achieved better performance rather than the pre...
متن کاملLearning Document Image Features With SqueezeNet Convolutional Neural Network
The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...
متن کاملA Novel Fuzzy Based Method for Heart Rate Variability Prediction
Abstract In this paper, a novel technique based on fuzzy method is presented for chaotic nonlinear time series prediction. Fuzzy approach with the gradient learning algorithm and methods constitutes the main components of this method. This learning process in this method is similar to conventional gradient descent learning process, except that the input patterns and parameters are stored in mem...
متن کاملImproving the Operation of Text Categorization Systems with Selecting Proper Features Based on PSO-LA
With the explosive growth in amount of information, it is highly required to utilize tools and methods in order to search, filter and manage resources. One of the major problems in text classification relates to the high dimensional feature spaces. Therefore, the main goal of text classification is to reduce the dimensionality of features space. There are many feature selection methods. However...
متن کامل